SPI-EM: towards a tool for predicting CATH superfamilies in 3D-EM maps.

نویسندگان

  • Javier A Velázquez-Muriel
  • Carlos O S Sorzano
  • Sjors H W Scheres
  • José-María Carazo
چکیده

In this paper the theoretical framework used to build a superfamily probability in electron microscopy (SPI-EM) is presented. SPI-EM is a new tool for determining the homologous superfamily to which a protein domain belongs looking at its three-dimensional electron microscopy map. The homologous superfamily is assigned according to the domain-architecture database CATH. Our method follows a probabilistic approach applied to the results of fitting protein domains into maps of proteins and the computation of local cross-correlation coefficient measures. The method has been tested and its usefulness proven with isolated domains at a resolution of 8 A and 12 A. Results obtained with simulated and experimental data at 10 A suggest that it is also feasible to detect the correct superfamily of the domains when dealing with electron microscopy maps containing multi-domain proteins. The inherent difficulties and limitations that multi-domain proteins impose are discussed. Our procedure is complementary to other techniques existing in the field to detect structural elements in electron microscopy maps like alpha-helices and beta-sheets. Based on the proposed methodology, a database of relevant distributions is being built to serve the community.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assigning genomic sequences to CATH

We report the latest release (version 1.6) of the CATH protein domains database (http://www.biochem.ucl. ac.uk/bsm/cath ). This is a hierarchical classification of 18 577 domains into evolutionary families and structural groupings. We have identified 1028 homo-logous superfamilies in which the proteins have both structural, and sequence or functional similarity. These can be further clustered i...

متن کامل

Predicting the Distribution of Leucanthemum Vulgare Lam. Using Logistic Regression in Fandoghlou Rangelands of Ardabil Province, Iran

Species Distribution Modelling (SDM) is an important tool for conservation planning and resource management. Invasive species represent a good opportunity to evaluate SDMs predictive accuracy with independent data as their invasive range can expand quickly. Thus, the aim of this study was to investigate the relationships between presence of Leucanthemum vulgare Lam. and environmental v...

متن کامل

Computational methods for constructing protein structure models from 3D electron microscopy maps.

Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein struct...

متن کامل

gEMfitter: A Highly Parallel FFT-Based 3D Density Fitting Tool With GPU Texture Memory Acceleration gEMfitter: A GPU-Accelerated 3D Density Fitting Tool

Fitting high resolution protein structures into low resolution cryo-electron microscopy (cryo-EM) density maps is an important technique for modeling the atomic structures of very large macromolecular assemblies. This article presents “gEMfitter”, a highly parallel fast Fourier transform (FFT) EM density fitting program which can exploit the special hardware properties of modern graphics proces...

متن کامل

The CATH classification revisited—architectures reviewed and new ways to characterize structural divergence in superfamilies

The latest version of CATH (class, architecture, topology, homology) (version 3.2), released in July 2008 (http://www.cathdb.info), contains 114,215 domains, 2178 Homologous superfamilies and 1110 fold groups. We have assigned 20,330 new domains, 87 new homologous superfamilies and 26 new folds since CATH release version 3.1. A total of 28,064 new domains have been assigned since our NAR 2007 d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of molecular biology

دوره 345 4  شماره 

صفحات  -

تاریخ انتشار 2005